The Bivariate Normal Copula 

Christian Meyer*''' 
December 15, 2009 

Abstract 

We collect well known and less known facts about the bivariate normal 
distribution and translate them into copula language. In addition, we 
prove a very general formula for the bivariate normal copula, we compute 
Gini's gamma, and we provide improved bounds and approximations on 
the diagonal. 

1 Introduction 

When it comes to modelling dependent random variables, in practice one often 
resorts to the multivariate normal distribution, mainly because it is easy to pa- 
rameterize and to deal with. In recent years, in particular in the quantitative 
finance community, we have also witnessed a trend to separate marginal distri- 
butions and dependence structure, using the copula concept. The normal (or 
Gauss, or Gaussian) copula has even come to the attention of the general public 
due to its use in the valuation of structured products and the decline of these 
products during the financial crisis of 2007 and 2008. 

The multivariate normal distribution has been studied since the 19th century. 
Many important results have been published in the 1950s and 1960s. Nowadays, 
quite some effort is spent on rediscovering some of them. 

In this note we will proceed as follows. We will concentrate on the bi- 
variate normal distribution because it is the most important case in practice 
(applications in quantitative finance include pricing of options and estimation 
of asset correlations; the impressive list by Balakrishnan & Lai (2009) men- 
tions applications in agriculture, biology, engineering, economics and finance, 
the environment, genetics, medicine, psychology, quality control, reliability and 
survival analysis, sociology, physical sciences and technology). We will give an 
extensive view on its properties. Everything will be reformulated in terms of 
the associated copula. 

We will also provide new results (at least if the author has not been redis- 
covering himself...), including a very general formula implying other well known 
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formulas, a derivation of Gini's gamma, and improved and very simple bounds 
for the diagonal. 

For collections of facts on the bivariate (or multivariate) distribution we refer 
to the books of Balakrishnan & Lai (2009), of Kotz, Balakrishnan & Johnson 
(2000), and of Patel & Read (1996), and to the survey article of Gupta (1963a) 
with its extensive bibliography (Gupta 1963b). For theory on copulas we refer 
to the book of Nclscn (2006). 

We will use the symbols P, E and V for probabilities, expectation values and 
variances. 



2 Definition and basic properties 

Denote by 

<f(x) ■= exp (~y) i H h ) : = J ¥>0) dx 

the density and distribution function of the standard normal distribution, and 
by 

1 fx 2 - 2gxy + y z 

ip 2 (x, y;g) := exp 



2n^T^ "V 2(1-^) 



/h rk 
/ (fi 2 (x,y;g) dy dx 
-oo J — oo 



the density and distribution function of the bivariate standard normal distribu- 
tion with correlation parameter g e (—1,1). The bivariate normal (or Gauss, 
or Gaussian) copula with parameter g is then defined by application of Sklar's 
theorem, cf. Section 2.3 of (Nelsen 2006): 

C(u,v;g) :=$ 2 ($-i( u ),$->);p) (2.1) 

For g € {-1,1}, the correlation matrix of the bivariate standard normal distri- 
bution becomes singular. Nevertheless, the distribution, and hence the normal 
copula, can be extended continuously. Wc may define 

C(u,v,— 1) := lim C(u, v; g) — max(u + v — 1, 0), (2.2) 

Q 

C(u, v, +1) := lim C(u, v; g) = min(w, v). (2.3) 

Q > + l 

Hence C(-, •; g), for g — > —1, approaches the lower Frechet bound, 

W(u, v) := max(u + v — 1, 0), 
and, for g — ► 1, approaches the upper Frechet bound, 

M(u, v) := min(u, v). 
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Furthermore, we have 

C(u,v;0) =u-v =:IL(u,v), (2.4) 

the independence copula. What happens in between can be described using the 
following differential equation derived by Plackett (1954): 

d d 2 

— $ 2 (x, y; g) = <p 2 (x, y; g) = ^T^ $2 ( x ' V' & ( 2 - 5 ) 

We find 

C(u, v; g) - C(u,v;a) = I(u, v;a, g) := f ip 2 ( < J>~ 1 (u), r) dr, (2.6) 

J a 

and in particular, 

C(u, v; g) = W(u, v) + I(u, v; -1, g), (2.7) 

= IL(u,v) + I(u,v;0,g), (2.8) 

= M(u,v)-I(u,v;q,1). (2.9) 

In other words, the bivariate normal copula allows comprehensive total concor- 
dance ordering with respect to g, cf. Section 2.8 of (Nelsen 2006): 

W(; ■) = C(; ■■ -1) -< C(; •; g) ~< C(; •; a) ~< C(; ■; 1) - M(; •) 

for — 1 < g < a < 1, i.e., for all u, v, 

W{u, v) = C{u, v; -1) < C(u, v; g) < C{u, v; a) < C{u, v, 1) = M(u, v). 
By plugging u = v — \ into (|2.8[) we obtain 

c (i he) = \ + [ 2nV l_ r2 dr = \ + h arcsin(e) ' (2 ' 10) 

a result already known to Stieltjes (1889). 

Mehler (1866) and Pearson (1901a), among other authors, obtained the so- 
called tetrachoric expansion in g: 

C(u, v; g) = uv + g (^(u)) g (^(v)) £ Re k (^(u)) He fc (^(v)) JL— - 

(2.11) 

where 

[fc/2] 



He fc (a;) = ^ 1 



t «(*-2i)! V 2 



are the Hermite polynomials. 
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The bivariate normal copula inherits the symmetries of the bivariate normal 
distribution: 

C(u,v;g) = C(v,u;g) (2.12) 
= u-C{u,l-v;-g) (2.13) 
= v-C(1-u,v;-q) (2.14) 
= u + v-l + C(l-u,l-v;g) (2.15) 

Here, (12 . 1 2[) is a consequence of exchangeability, and (|2.15|) is a consequence of 
radial symmetry, cf. Section 2.7 of (Nelsen 2006). 

In the following sections we will discuss formulas for the bivariate normal 
copula, its numerical evaluation, bounds and approximations, measures of con- 
cordance, and univariate distributions related to the bivariate normal copula. 
It will be convenient to assume g ^ { — 1,0, 1} unless explicitly stated otherwise 
(cf. (|2.2[) , l|2.4p . (|2.3[) for the simple formulation in the missing cases). 



3 Formulas 

If (X, Y) are bivariate standard normally distributed with correlation g then 
Y conditional on X = x is normally distributed with expectation gx and vari- 
ance 1 — g 2 . This translates into the following formulas for the conditional 
distributions of the bivariate normal copula: 

The copula density is given by: 

, x 3 2 y 2 (<S>~\u),<S>-\v);g) 

C(M ' ^ ^ - = d^¥v C{U ' ^ ^ = <p ($-!(„)) V ($-!(«)) (3 - 3) 

1 / 2g$- 1 {u)<p- 1 (v) - g 2 (^(u) 2 + ^(v) 2 ) \ 

" ^=7 exp [ W=V J 

In terms of its copula density, the bivariate normal copula can be written as 

f-U f-V 

C(u,v;g)= / / c(s , t; g) dt ds . (3-4) 
Jo Jo 
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Now let a, (3, 7 e (—1, 1) with a/37 — £?• Then the following holds: 



C(u,v; g) 



OO />00 



x /<p- 1 (u)-a-x\ ^ ($- 1 (v)-0-y\ . . , , ,„ „, 



JO 



jf (j^C(u,s-,a)) (J^C(t,v;f3)^c( S ,t ;i )dtd s 

The right hand side of (|3.5p has occurred in credit risk modelling but without 
the link to the bivariate normal copula, cf. (Bluhm & Overbeck 2003). A proof 
in that context will be provided in Section [A. 11 

Other formulas for C(u, v; g) are obtained by carefully studying the limit as 
some of the variables a, /3, 7 are approaching the value one. The interesting 
cases, not regarding symmetry, are listed in Table [T] 



a 


P 


7 


reference 


= a 


= /3 


= 7 




= 1 


= 


= 7 


(EH 


= a 


= 


= 1 


(22) 


= 1 


= Q 


= 1 


EU) 


= 1 


= 1 


= Q 


@3) 



Table 1: Limiting cases of (|3.5 
By approaching a = 1 in ()3.5() we obtain: 



C(u,v;g)= I / $ LLJ- U <p 2 (x,yn) dy dx (3.6) 

-00 J-00 y \J\ — p A ) 

" rl * /*~ 1 (f)-/3-$- 1 (*)\ , v j j 



However, Equation (|3 . 6[) may be considered rather unattractive. By approaching 
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7 = 1 in (|3.5[) instead we obtain a more interesting formula: 

C{u, v;g) = $ V J = * 7=== <P(*) dz ( 3 - 7 ) 



y/1 -a 2 J \ yi - ^ 3 



VI - a 2 J \ a/1 -/3 2 
^C(t,v;a)) ^C(u,i;/3)) eft 

Equation (13. 7p seems to have been discovered repeatedly, sometimes in more 
general context (e.g., multivariate, cf. (Steck & Owen 1962)), sometimes for 
special cases. Gupta (1963a) gives credit to Dunnett & Sobel (1955). 
By approaching a — 7 = 1 in (|3 . 5[) we obtain: 

C{u, v; g) = jf * ( - 1( ^L£-!~ ^ (ft = jf ^C(t, v; g) dt (3.8) 

These formulas can also be derived from (|3.ip and (|3.2p . Finally, by approaching 
a = = 1 in (|3.5p we rediscover (|3.4|) . 

In order to evaluate the bivariate normal distribution function numerically, 
Owen (1956) defined the following very useful function, to which we will refer 
as Owen's T-function: 

T(h,a) = -J Q _ dx = <p(h) ^ — 2 dx (3.10) 

He proved that 

C(u,v-g) = — T a u ) — T ($>~ x (v), a v ) — 5(u, v) (3.11) 
where 

st \ fh> if u <\' v >\ or u>\,v<\ 

:= n , ( 3 - 12 ) 
I 0, else 

and 

au = 7^=7 {*=W ~ g ) ' av = ~ v ' (3 ' 13) 

In particular, on the lines defined by v = ^ and by u = v, the following formulas 
hold: 

C («, §; f>) = \ - T U-\u), -^=f ) . (3- 14 ) 



r {ll .u. L » = ll -2T[^- l in).Jl-^i) (8.15 1 
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From (|3.11[) and (|3.14[) we can derive the useful formula 

C(u, v;g) = C (u, \; g u ) + C (v, |; g v ) - 8(u, v), (3.16) 

where 

sin(arctan(— a u )), 



at. 



q v = = = sin(arctan(— a v )). 

v 1 + a l 



On the diagonal u = v, (|3.16j) reads: 



C(u,u;q)=2-C[u,1;-J^-} (3.17) 



Inversion of (|3 . 1 T[) using (|2.13[) gives: 

C(u,h e ) = My} 1 - 2 < >) > 2 , e< °' (3.18) 

V 2 ' \u- \C{u,u-1~2q 2 ), g>0. y ' 

Applying ([575]) to (j3~T7) we obtain, cf. also (Steck & Owen 1962), 

C(u,u;g) =2 - / 5 (t; p) eft (3.19) 



with 



g {u .g)~$(j±-A.<S>-\ U )y (3.20) 

We find 

-^C(u,u;g)=2-g{u;g). (3.21) 

The function g will become important in Section \5\ Note that if U, V are 
uniformly distributed on [0, 1] with copula C(-, •; g) then 

0(u;g) = P(V < u| [/ = u). 

Below we list some properties of 

(3.22) 
(3.23) 
(3.24) 

;g), (3.25) 
(3.26) 

In particular, (|3.22[) and (|3.23[) show that the bivariate normal copula does not 
exhibit (lower or upper) tail dependence (cf. Section 5.2.3 of Embrechts, Frey 
& McNeil (2005)). 

Substitution of t = g(s; g) in (|3.19| and application of (|3 . 26(1 lead to the 
identity, cf. also (Steck & Owen 1962): 

C(u, u; g) = 2u- g(u; g) - C (g(u; g), g(u\ g); -g) (3.27) 



lim g(u;g) 


= 0, 


u >0+ 




lim g(u;g) 

u >1 


= 1, 




i 

2 ' 


g(l -u;g) 


= 1 - 


g{g(u;g);-g) 


= U 
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4 Numerical evaluation 



The bivariate normal copula has to be evaluated numerically. To my knowledge, 
in the literature there is no direct approach. Hence for the time being we have 
to rely on the definition of the bivariate normal copula (|2.1[) and numerical eval- 
uation of $2 and <j> _1 . There are excellent algorithms available for evaluation of 
cf. (Acklam 2004); the main problem is evaluation of the bivariate normal 
distribution function $2- 

In the literature on evaluation of $2 there are basically two approaches: 
application of a multivariate method to the bivariate case, and explicit consid- 
eration of the bivariate case. For background on multivariate methods we refer 
to the recent book by Bretz & Genz (2009). In most cases, bivariate methods 
will be able to obtain the desired accuracy in less time. In the following we 
will provide an overview on the literature. We will concentrate on methods 
and omit references dealing with implementation only. Comparisons of differ- 
ent approaches in terms of accuracy and running time have been provided by 
numerous authors, e.g., Agca & Chance (2003), Terza & Welland (1991), and 
Wang & Kennedy (1990). 

Before the advent of widely available computer power, extensive tables of the 
bivariate normal distribution function had to be created. Using (|3 . 16[) or similar 
approaches, the three-dimensional problem (two variables and the correlation 
parameter) was reduced to a two-dimensional one. 

Pearson (1901a) used the tetrachoric expansion (|2.11[) for small \g\, and 
quadrature for large \g\. Nicholson (1943), building on ideas of Sheppard (1900), 
worked with a two-parameter function, denoted l^-function. Owen (1956) intro- 
duced the T-function (|3.10[) which is closely related to Nicholson's F-function. 
For many years, quadrature of the T-function was the method of choice for 
evaluation of the bivariate normal distribution. Numerous authors, e.g. Borth 
(1973), Daley (1974), Young & Minder (1974), and Patefield & Tandy (2000), 
have been working on improvements, e.g. by dividing the plane into many 
regions and choosing specific quadrature methods in each region. 

Sowden & Ashford (1969) applied Gauss-Hermite quadrature to (|3.7|) and 
Simpson's rule to (|3.8|) . Drezner (1978) used (13.16|) and Gauss quadrature. Divgi 
(1979) relied on polar coordinates and an approximation to the univariate Mills' 
ratio. Vasicek (1998) proposed an expansion which is more suitable for large \g\ 
than the tetrachoric expansion (|2. 1 1|) . 

Drezner & Wesolowsky (1990) applied Gauss-Legendre quadrature to (|2.8[) 
for \g\ < 0.8, and to (|2.9[) for \g\ > 0.8. Improvements of their method in terms 
of accuracy and robustness have been provided by Genz (2004) and West (2005). 

Most implementations today will rely on variants of the approaches of Divgi 
(1979) or of Drezner & Wesolowsky (1990). The method of Drezner (1978), 
although less reliable, is also very common, mainly because it is proposed in 
(Hull 2008) and other prevalent books. 
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5 Bounds and approximations 



Nowadays, high- precision numerical evaluation of the bivariate normal copula 
is usually available and there is not much need for low-precision approximations 
anymore, in particular if the mathematics is hidden behind strange constants 
derived from some optimization procedure. 

On the other hand, if the mathematics is general and transparent then the 
resulting approximations are often rather weak. As an example, take approxi- 
mations to the multivariate Mills' ratios applied to the bivariate case, cf. (Lu 
& Li 2009) and the references therein. 

Bounds, if they are not too weak, are more interesting than approximations 
because they can be used, e.g., for checking numerical algorithms. Moreover, 
derivation of bounds often provides valuable insight into the mathematics behind 
the function to be bounded. 

In the following we concentrate on bounds and approximations explicitly 
derived for the bivariate case. Throughout this section we will only consider the 
case g > 0, < u = v < 1/2. Note that by successively applying, if required, 
(|3.16[) . (|3.18|) , (|3.27[) and (|2. 15[) . we can always reduce C(u, v; g) to a sum of two 
terms of that form. Any approximation or bound given for the special case can 
be translated to an approximation or bound for the general case, with at most 
twice the absolute error. Note also that for many existing approximations and 
bounds the diagonal u ~ v may be considered a worst case, cf. (Willink 2004). 

Mee & Owen (1983) elaborated on the so-called conditional approach pro- 
posed by Pearson (1901b). If (X, Y) are bivariate standard normally distributed 
with correlation g then we can write 

<f> 2 (h,k;g) = -P(Y < k\X < h). (5.1) 

The distribution of Y conditional on X = h is normal but the distribution of Y 
conditional on X < h is not. Nevertheless, it can be approximated by a normal 
distribution with the same mean and variance. In terms of the bivariate normal 
copula, the resulting approximation is 

nt s */' u-$>-\u) + g-y(§-\u)) \ 

C(u, u; g) w u ■ $ . . (5.2) 

\ ^u 2 - g 2 ■ <p ($-!(u)) • (u ■ $-!(«) + V (*" 1 H)) / 

The approximation works well for \g\ not too large. For \g\ large there are 
alternative approximations, e.g. (Albers & Kallenberg 1994). 

The simpler of the two approximations proposed by Cox & Wermuth (1991) 
replaces the second factor in (|5.1j) by the mean of the conditional distribution 
(the more complicated approximation adds a term of second order). In terms 
of the bivariate normal copula, the resulting approximation is 



C(u, u; g) s» u ■ $ 



l-Q u- + g- (p ($" 1 (m)) 

(1 + q)-u 
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Mallows (1959) gave two approximations to Owen's T-function (|3.10[) . In 
terms of the bivariate normal copula, the simpler one reads 



1 — q ( ( u 1 \ ^ R _ 1 ( 3 



Further approximations were derived by Cadwell (1951) and Polya (1949). 
There are not too many bounds available in the literature. For g > there 
are, of course, the trivial bounds (12.21) and (12.31): 



u 2 < C(u, u; g) < u 

The upper bound given by Polya (1949) is just the one above. His lower bound is 
too weak (even negative) on the diagonal. A recent overview on known bounds, 
and derivation of some new ones, is provided by Willink (2004). We will present 
some of his bounds below in more general context. 

Theorem 5.1 Let g > and < u < 1/2. Then C(u,u;g) is bounded as 
follows, where g(u; g) is defined as in H3.20\) : 

C(u,u; g) > u ■ g{u; g) (5-3) 
C(u,u; g) < u ■ g(u; g) ■ 2 (5.4) 



The lower bound \5.3\) is tight for g = or u = 0. The maximum error of {57, 
equals 1/4 and is obtained for u = 1/2, g = 1. The upper bound \5.Ji\ ) is tight 
for g — 1 or u = 0. The maximum error of |5.^[ ) equals 1/4 and is obtained for 
u = 1/2,0 = 0. 

A proof of Theorem 15.11 is provided in Section IA.21 together with a proof of 
the following refinement: 

Theorem 5.2 Let g > and < u < 1/2. Then C(u,u;g) is bounded as 
follows, where g(u; g) is defined as in \3.20\) : 

C(u, u; g) > u ■ g(u; g) ■ (l -\ — arcsin(f?)J (5-5) 

C(u, u; q) < u ■ g(u; q) ■ (1 + q) (5.6) 

These bounds are the optimal ones of the form u ■ g{u) ■ a{g). They are tight for 
g = 0, g = 1, or u = 0. The maximum error of i5. 6]) is obtained for u = 1/2, 



g = \ 1 — % « 0.7712, the value being 



i^l--i-^arcsin^l-Ajj Rj 0.05263. 
The lower bound \5.5\) is tight for u = 1/2. 
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The bounds (|5.3[) and (|5.6p have been discussed, without explicit computa- 
tion of the maximum error, by Willink (2004). The maximum error of (|5.5I) is 
difficult to grasp analytically. Numerically, the error always stays below 0.006. 

An alternative upper bound is given by the following theorem, the proof of 
which is provided in Section [A. 31 

Theorem 5.3 Let g > and < u < 1/2. Then 

C(u,u;g) < 2u-g(~;g) . 

The bound is tight for g = 0, g = 1, or u = 0. The maximum error is obtained 
for u= 1/2, 

l + g _( jl - g 



2^. $-i(i) ^\\l+g 
i.e., g w 0.5961, the value being approx. 0.015. 

It is also possible to derive good approximations to C(u, u; g) by considering 
the family C(u, u; g) ss u ■ g(u; g) ■ {a(g) + b(g)u). In particular, the choice 

(2 \ 4 

a(g) := 1 + g, b(g) := 2 • 1 H — arcsin(g) — (1 + g) I = — arcsin(g) — 2g 

\ TT J TT 

is attractive because the resulting approximation 

C{u, u; g) ~ u ■ g(u\ g) ■ ^1 + g + ^— arcsin(g) — 2g^j ii^j (5-7) 

is tight for g = 0, g = 1, u = 0, or u = 1/2, and for u — > + it has the same 
asymptotic behaviour as (|5.6p . By visual inspection we may conjecture that it 
is even an upper bound, with an error almost cancelling the error of the lower 
bound (|5.5p most of the time. Consequently, an even better approximation (but 
not a lower bound, for g large) is given by 

C(u, u; g) ~ u ■ g(u; g) ■ f 1 + ^ + ^— arcsin(g) — g^j u J . (5-8) 

Again, (|5.8p is tight for g = 0, g = 1, u = 0, or u = 1/2. Numerically, 
the absolute error always stays below 0.0006. Hence (|5.8p is comparable in 
performance with (15. 2p . and much better for g large. 



6 Measures of concordance 

In the study of dependence between (two) random variables, properties and 
measures that are scale- invariant, i.e., invariant under strictly increasing trans- 
formations of the random variables, can be expressed in terms of the (bivariate) 
copula of the random variables. Among these are the so-called measures of 
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concordance, in particular Kendall's tau, Spearman's rho, Blomqvist's beta and 
Gini's gamma. For background and general definitions and properties we refer to 
Section 5 of (Nelsen 2006). In this section we will provide formulas for measures 
of concordance for the bivariate normal copula, depending on the correlation 
parameter g. 

Blomqvist's beta follows immediately from (|2.10D : 

0(g) :=4-C(§,£;g)-l 
2 

= — arcsin(£>) (6-1) 

TT 

For the bivariate normal copula, Kendall's tau equals Blomqvist's beta: 



r(g) :=4 f f 
J o Jo 



1-4 
2 



C(u, v; g) dC(u, v; g) — 1 

/ / (Jiv c<yU,v;g ^) (JL G ^ u ' v ' g ^) dudv 



TT 



arcsin(g) (6.2) 



For a proof of dHUJ) and cf. Section 5.3.2 of (Embrechts et al. 2005). Both 
Blomqvist's beta and Kendall's tau can be generalized to (copulas of) elliptical 
distributions, cf. (Lindskog, McNeil & Schmock 2003). This is not the case for 
Spearman's rho, cf. (Hult & Lindskog 2002), which is given by: 



g s (g) := 12 / / 
Jo Jo 



C(u, v; g) — uv du dv 

= 12 / C(u, v; g) du dv — 3 
Jo Jo 

(I) M 



6 

— arcsin 

TT 



For proofs of cf. (Kruskal 1958) or Section 5.3.2 of (Embrechts et al. 2005). 
Gini's gamma for the bivariate normal copula is given as follows: 

7(g) := 4 ( I C(u, u; g) du + j C(u 7 1 — u; g) du 



2 

= 4 ( f C(u, u; g) du + j u — C(u, u; —g) du 

\Jo Jo 2 

— I arcsm I — arcsin I I (6-4) 



tt V V 2 / V 2 

— I arcsm I — I — arcsin I — - — I I (d-5) 

1 arcsin Q ( V(l + e)(3 + g) - y/(l - g)(3 - gj^j (6.6) 
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I have not been able to find proofs in the literature, hence they will be provided 
in Section [PI 

Equation (|6.6[) can be inverted which may be useful for estimation of g from 
an estimate for 7(g): 



g = sm 



(7(0) • \) ^3 -tan (7(0) ■ |) (6.7) 



Finally, we propose a new measure of concordance, similar to Gini's gamma 
but based on the lines u = 1/2, v = 1/2 instead of the diagonals. For a bivariate 
copula C it is defined by: 



7 (C(, •)) := 4 (7* cM)-\du + fc{\ iV )- V - dv 
(J c(u,%) du + J C{\,v) dv-^j 



(6.8) 



= 4 

For the bivariate normal copula we obtain 



7 (e) :=7(C(-, •;*?)) = -arcsinf^ ) . ((,<>) 



7T 



A proof is given implicitly in Section IA.4I 



7 Univariate distributions 

In this section we will discuss two univariate distributions being closely related 
to the bivariate normal copula (or distribution). 

7.1 The skew-normal distribution 

A random variable X on K is skew-normally distributed with skewness parameter 
A e R if it has a density function of the form 

h(x) = 2<p(xM\x). (7.1) 

The skew-normal distribution was introduced by O'Hagan & Leonard (1976) and 
studied and made popular by Azzalini (1985, 1986). Its cumulative distribution 
function is given by 

/x /><&(x) 
2ip(x)<f>(\x) = 2 / $ (A<i>~ 1 (i)) dt. (7.2) 
-00 Jo 
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In the light of (|3.19[) and (|3.17[) . cf. also (Azzalini & Capitanio 2003), we find 
V(X<x)=2$ 2 (x,0 ] - ^L- ? ) (7.3) 

A>0, 

/ l-A 2 \ {1A) 

l-^2^-x,-x- YT ^y A<0. 

In particular, the bounds given in Theorem 15.21 can be applied. 

7.2 The Vasicek distribution 

A random variable P on the interval [0, 1] is Vasicek distributed with parameters 
p G (0, 1) and g € (0, 1) if $ _1 (P) is normally distributed with mean 

E^P =^= (7-5) 

and variance 

y($-\P)) = * (7.6) 
1 - Q 

In Section [A. II implicitly it is proved that 

E(P)=p, E(P 2 ) = C(p,p;e), 

so that 

V(P) = E(P 2 ) - E(P) 2 = C(p,p; g) - p 2 . 
Furthermore, we have 

P(P <q)= P(*-l(P) < *"l(g)) = $ f ^- r ^'^ 
The (one-sided) a-Quantile q a of P, with a £ (0, 1), is therefore given by 

fc -»( ^-*"^±*"'W ). (7.7) 



In particular, the median of P is simply 
The density of P is 



q . 5 = ^(^M) =$(E($- 1 (P)))- (7.8) 



Sp< ff ) = J^-/^-*~ 1(?) -*~ 1(p) 
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The distribution is unimodal with the mode at 



for g < 0.5, monotone for g — 0.5, and U-shaped for g > 0.5. 
Let P be Vasicek distributed with parameters p, g, and let 

corr (^- 1 (P),^- 1 (P)\ = 7. 

Then 

cov (^- 1 (P),^- 1 (P) S j = 7 • 

and 

cov fp, p) = C (p,p; 7 • Vfi" ^) - P ■ P- 

The Vasicek distribution does not offer immediate advantages over other 
two-parametric continuous distributions on (0, 1), such as the beta distribution. 
Its importance stems from its occurrence as mixing distribution in linear factor 
models set up as in Section IA.ll It is a special case of a probit-normal distri- 
bution; it is named after Vasicek who introduced it into credit risk modeling. 

For (different) details on the material in this section we refer to Vasicek 
(1987, 2002) and Tasche (2008). Estimation of the parameters p and g is also 
discussed by Meyer (2009). 



1 — g 1 — g 
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A Proofs 

A.l Proof of (SUD 

Let 



X = a ■ Y + a/1 - a 2 ■ e, 

X = [3 ■ Y + \/l-/3 2 ■ i, 

where a, (3 € (—1, 1) \ {0} are parameters and where Y, Y, e, e are all standard 
normal and pairwise independent, except 



7 := corr \ Y, Yj — cov \ Y, Yj 
By construction, X and X are standard normal again with 
corr (x, x) = cov (x, x) = 0^7. 



We define indicator variables Z = Z(X) € {0, 1}, Z = Z(X) e {0, 1} calibrated 
to expectation values u, v: 

Z=l :<S=^ X^^iu), 

Z=l X^^iv) 
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Conditional on (Y = y, Y = y), Z and Z are independent. We find 

F(Z = 1 1 Y = y) = P (X < $- x (u) | Y = y) 
= P (a ■ y + \/l -a 2 ■ e < 



Now we define the random variables 

- a ■ y s 



P := P(T) := $ 
P := P(Y) := $ 



VI -a 2 

$- 1 («)-/3-y 



We find 

u = E(Z) = E(E(Z | y)) = E(F(Z = 1 1 Y)) = E(P), w = E(P) 

and 

r Z=l,Z=l) = f(x < < 



= (^$- 1 (m),$- 1 (w),cov(X,X) 
= ($- 1 (u),$- 1 (w),a/3 7 ) . 

On the other hand, 

Z = \,Z = \\ = P(ZZ = 1) = E[ZZ 



i(zz I y y)) = e (e(z | y y) • e(z \ y y)) 
e (e(z I y) ■ e(z l y)) = e (p ■ P) 



oo /•oo 



-OO — OO 
OO /"OO 



oo « — oo 



P(x) ■ P{y) ■ cp 2 (x, y, 7) dx dy 



A. 2 Proof of Theorems D and B 

We will assume g as fixed and write 

C(u) := C(u,u;q), g(u) := g(u; g). 
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The upper bound (|5.4[) follows from (13.27P . Regarding the lower bound (|5.3|) 
we note that 



An) = ^ 



^^..exp^.^^O, 

1 + g tp(9 L {u)j 
Hence g is increasing and concave on (0, 1/2) and we conclude 

C{u) = 2 J g{t) dt>2-yu- g(u) = u ■ g(u). 

Now we define 

D a {u) := u ■ g(u) ■ a 
with a G [1, 2], u G [0, 1/2]. We start by noting that 

C'(u) = 2g(u) > 0, C"(u) = 2g'(u) > 0, 

hence C is increasing and convex on (0, 1/2). Furthermore, 

fl',(«) = o.( J (t()+ii.j'W)>0 1 
D'^u) = a-(2-g'{u)+u-g"{u)) 

-it 



2a • g'(u) • 1 + — : / ' 

2a- ff '(u)-('l- T ^- J ff(-$- 1 ( U )) 



with -ff(a;) = a; • where 

1 - $(a>) 



i?(.x) = 



is Mills' ratio. Pinelis (2002) has shown that iJ'(x) > for x > 0, ff(0) = 0, 

and lima; >ooH(x) = 1. Hence D a (u) is increasing and convex on (0,1/2) as 

well. 

For u G (0, 1/2), C"(it) = D' a (u) is equivalent with 

u-g» 2- a 2 
/(«):= r^— = , or a = 

We will show that / is strictly increasing on (0, 1/2). We have 
with A = x f]=i G [0, 1] and 
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We find 

R'(x) ■ R(X ■ x) - R(x) ■ R'(X ■ x) 



R(X-x) 2 
'R'(x) , R'(X-x)\ 



for A < 1. Here we have used that F\(x) > and that the function 

R'(y) 

is strictly decreasing on (0,oo), cf. (Baricz 2008). Wc conclude 

f{u) = -x • - ;/ > o. 

Furthermore, we find 



and 



g'(u) + u ■ g"(u) 



lim f{u) = lim 



u — >o+ ' u — >o+ g'( u ) 

= hm H — • — 

u >0+ 1 + Q ip(^- 1 (u)) 

= lim 1 __?£_. £f (_$-!(«)) 
_ 1 2g _ 1 - g 

i + e i + g' 

We have C(0) = D o (0) = 0, C"(0) = ££(0) = 0, and 



« — >o+ C(u) « — >o+ C"(m) « — >o+ 2 1 + 

By standard calculus we conclude that 

• For a > 1 + gwe have D' a (u) > C'(u), and hence D a (u) > C(u), for all 
ue [0,1/2]; 

• For a < 2 • (l + y / ^f) we have D'Ju) < C'(u), and hence D a (u) < 
C{u), for all u e [0,1/2]; 

• For 

-i \ 
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there exists u £ (0, 1/2) with D' a {u) < C'(u) for u £ (0, u ), and D' a (u) > 
C'(u) for u £ (uo,l/2). Consequently, the best lower bound for C of 
the form D a is obtained if C(l/2) = D a (l/2), i.e., a = 1 + - arcsin(g). 
Moreover, the upper bound D a with a = 1 + g can not be improved. 



The maximum error of D a with a = 1 + g is attained if 
[A,(l/2) - C(l/2)] = - - i = 



which is equivalent with g = y 1 — ^ . 

A.3 Proof of Theorem ET51 

We will assume g as fixed and write 

:= C(u,u;g), g(u) := g(u; g). 

We have 

C(u) = 2 • / g(t) dt = 2u- g(v(u)) 
Jo 

with 

v(u) :— v(u; g) < u. 
Since g is concave and increasing, we even know that 

and hence 

2 

Moreover, for the same reason we have 



and hence, for g fixed, the maximum error is obtained for u = 1/2, the value 
being 

~ 1 ~-ifA\ i i 



Derivation of the above expression with respect to g gives the result. 
Note that by (|3.26|) we can write 

(C{u) 
v{u; g) = g\ — — ; -g 



2u 

Unfortunately, for large g, the function v is not convex, and the approximation 

'u\ C{\) 



2u- g 

is not an upper bound for C(u). 



g{\) 
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A.4 Proof of (gap , (1631) . (l6~T6l) 

In a first step, using (|2.8[) we find: 



C(u, u;g) du = u + — / exp — dr du 

1 1 f e 1 



, , V>(v) exp — dv dr 



— I / — , / u>(s)\ — — — ds dr 

3 2W0 VT^J-00 13 + r 

1 1 f e 1 



3 2tt7 v /(l- r )(3 + r) 
dr 



dr 



l + e 

1 1 1 



We conclude that 



3 2tt 7 1 - r 2 

11/. /l + g\ . /I 

— I arcsm — arcsm - 

3 2tt V V 2 J \2 

1 1 . fl + Q 

arcsm 



2tt V 2 



7(g) = 4 ( I C(u,u;g)du+ I u — C(u,u;—g) du — — 
Jo 2 



, , 1 1 . /i + e\ 11 1 ■ /1 - £»\ 1 

4 — I arcsm M arcsm 



4 2f V 2 / 24 2 7r \2/2 

2 / . /l + A . /I - Q 



arcsm | — ■ — ] - - arcsui 



7T V \ 2 J V 2 

In a similar way, using again (|2.8|) . we can compute 



~( 1 \ j f'u 1 f e I ( <P-\u) 2 



1 I . 



= — arcsm _ , . 

4 2vr 

which, using (|3 . 1 7|) . leads to 

y C(w, u; g) du — 2 ■ J C (^u, |; — ^ U = 2 arcsm ^ 

We obtain alternative formulas for Gini's gamma, the second one using the 
addition theorem for the arcsin function: 

( v 4 / . /VTT]?\ . ^a/i - g 

j(g) = — I arcsin I — I — arcsin 



4 

— arcsin 

7T 



Q(V(i + 0)(3 + eW(i-0)(3-e))) 
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